Word boundary detection using landmarks: a survey of consonants

نویسنده

  • Xuemin Chi
چکیده

This project searches for consistent acoustic attributes in a broad set of American English consonants that would help in identifying their word positions in running speech. A database of sentences containing word pairs (e.g. "lay keys" vs. "lake ease" for /k/) of thirteen consonants (six stops, two affricates, three fricatives, and two nasals), controlled for prosodic boundaries, pitch accents, phonetic contexts, and word positions (initial vs. final), was recorded from six speakers. On the assumption that consonants might be articulated differently at word onsets, several temporal and spectral measurements were made and contrasted as a function of word position. The relatively simple measurement of duration did quite well in distinguishing word-initial (being longer) from word-final positions in our database. For stops and affricates at word onsets, speakers are found to lengthen closure and release durations differently, depending on voicing, suggesting that enhancement of paradigmatic contrast is made for these consonants. The identity of the following vowel (/i/ or /o/) had no consistent effect on the durations of the consonants. Word-initial consonants were found to be less variable than word-final ones, supporting the claim that word onsets are perceptual "islands of reliability" in the lexical access process. Durations of word-onset consonants were relatively constant within each sound class (voicing, stops, affricates, fricatives, nasals), independent of place of articulation. By using acoustic landmarks, from which information about manner as well as durations can be easily extracted, word segmentation and/or lexical access processes can start without the complete identification of all features (such as place) for a particular segment. Acoustic landmarks can thus be used either singly, in identifying acoustically interesting regions where place features can be identified, or in combinations, from which manner features (Park, 2008) and temporal relations can be derived, to drive higher-level processing (e.g. word segmentation and lexical access) of the speech signal. Thesis Supervisor: Kenneth Noble Stevens Title: Clarence J Lebel Professor Of Electrical Engineering

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

مدل‌سازی بازشناسی واجی کلمات فارسی

Abstract of spoken word recognition is proposed. This model is particularly concerned with extraction of cues from the signal leading to a specification of a word in terms of bundles of distinctive features, which are assumed to be the building blocks of words. In the model proposed, auditory input is chunked into a set of successive time slices. It is assumed that the derivation of the underly...

متن کامل

Automatic Optic Disc Center and Boundary Detection in Color Fundus Images

Accurately detection of retinal landmarks, like optic disc, is an important step in the computer aided diagnosis frameworks. This paper presents an efficient method for automatic detection of the optic disc’s center and estimating its boundary. The center and initial diameter of optic disc are estimated by employing an ANN classifier. The ANN classifier employs visual features of vessels and th...

متن کامل

The contribution of obstruent consonants and acoustic landmarks to speech recognition in noise.

The obstruent consonants (e.g., stops) are more susceptible to noise than vowels, raising the question whether the degradation of speech intelligibility in noise can be attributed, at least partially, to the loss of information carried by obstruent consonants. Experiment 1 assesses the contribution of obstruent consonants to speech recognition in noise by presenting sentences containing clean o...

متن کامل

Ambisyllabic consonants as foot-medial onsets

Background The syllabic affiliation of ambisyllabic consonants is unclear (e.g., placid ["plæsId] & limit ["lImIt]). Standard analyses argue for their simultaneous linkage to the preceding and following syllables (Kahn 1976; Kenstowicz 1994). The analysis has been argued to receive support from meta-linguistic syllable boundary judgement tasks (Derwing 1992; Treiman and Danis 1988). However, sy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008